J. Biochem. 143, 525-536 (2008) 
doi:10.1093/jb/mvm246 

Residues on the Dimer Interface of SARS Coronavirus 3C-like 
Protease: Dimer Stability Characterization and Enzyme 
Catalytic Activity Analysis 

Shuai Chen, Jian Zhang, Tiancen Hu, Kaixian Chen, Hualiang Jiang' and Xu Shen* 

Drug Discovery and Design Center, State Key Laboratory of Drug Research, Shanghai Institute of Materia 
Medica, Chinese Academy of Sciences, 555 Zuchongzhi Road, Shanghai 201203, China 


Received August 26, 2007; accepted December 18, 2007; published online January 7, 2008 


3C-like protease (3CL pro ) plays pivotal roles in the life cycle of severe acute 
respiratory syndrome coronavirus (SARS-CoV) and only the dimeric protease is 
proposed as the functional form. Guided by the crystal structure and molecular 
dynamics simulations, we performed systematic mutation analyses to identify 
residues critical for 3CL pro dimerization and activity in this study. Seven residues 
on the dimer interface were selected for evaluating their contributions to dimer 
stability and catalytic activity by biophysical and biochemical methods. These 
residues are involved in dimerization through hydrogen bonding and broadly 
located in the N-terminal finger, the 2 -helix A' of domain I, and the oxyanion loop 
near the SI substrate-binding subsite in domain II. We revealed that all seven single 
mutated proteases still have the dimeric species but the monomer-dimer equilibria 
of these mutants vary from each other, implying that these residues might contribute 
differently to the dimer stability. Such a conclusion could be further verified by the 
results that the proteolytic activities of these mutants also decrease to varying 
degrees. The present study would help us better understand the dimerization- 
activity relationship of SARS-CoV 3CL pro and afford potential information for 
designing anti-viral compounds targeting the dimer interface of the protease. 

Key words: catalytic mechanism, dimerization-activity relationship, dimer interface, 
residue-residue interactions, site-directed mutagenesis. 

Abbreviations: 3Cl pro , 3C-like protease; CD, circular dichroism; Dabcyl, 4-[[4-(dimethylamino) phenyl] azo] 
benzoic acid; EDANS, 5-[(2'-aminoethyl)-amino] naphthelenesulfonic acid; FRET, fluorescence resonance 
energy transfer; SARS-CoV, severe acute respiratory syndrome coronavirus; SEC, size-exclusion 
chromatography; WT, wild type. 


The disease of severe acute respiratory syndrome (SARS) 
broke out in China and menaced to more than 30 other 
countries from the end of 2002 to June 2003. SARS 
coronavirus (SARS-CoV) was identified as the etiological 
agent responsible for this infection ( 1, 2). SARS-CoV 
involves the largest viral RNA genome known to date, 
encompassing 29,727 nucleotides predicted to contain 14 
functional open reading frames (ORFs) (3). Two large 
5'-terminal ORFs, la and lb, encode two overlapping 
polyproteins, ppla (around 450kDa) and pplab (around 
750 kDa) necessary for viral RNA synthesis. Polyproteins 
ppla and pplab are cleaved extensively by 3C-like 
protease (3CL pro ) and a papain-like cysteine protease 
(PL2 pro ) to yield a multi-subunits protein complex 
called ‘viral replicase-transcriptase’ ( 4 ). Considering its 
functional indispensability in coronavirus life cycle, 
SARS-CoV 3CL pro has become an attractive target in 
discovering new anti-SARS agents (5). 
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The crystal structure revealed that SARS-CoV 3CL pro 
can form a dimer with the two monomers oriented 
perpendicular to one another (Fig. 1A and B) (6). Each 
monomer contains three domains: domains I (residues 
8-101) and II (residues 102-184) have six-stranded 
antiparallel (3-barrel forming a chymotrypsin fold, the 
substrate-binding pocket is located in a cleft between 
these two domains, while domain III (residues 201-303) 
is an antiparallel globular cluster of five a-helices 
connecting to domain II by a long loop region (residues 
185-200). The 16-residue loop region has been implicated 
to mediate the substrate-binding (7). Based on the crystal 
structure, the N-terminal finger (residues 1-7) might 
play an important role in both dimerization and enzy¬ 
matic activity of SARS-CoV 3CL pro . Numerous reports 
have proven that the N-terminal finger contributes well 
to dimerization of SARS-CoV 3CL pro (8-10). In addition, 
domain III has also been revealed to extensively involve 
in monomer-monomer interactions (7, 11). Furthermore, 
Hsu et al. (9) reported that the residue Arg4 and the last 
C-terminal helix (residues 293-306) are critical for 
stabilizing the dimer structure to maintain a correct 
conformation of the active site. As the crystal structures 
of different CoV 3CL proteases give similar dimeric 
structures and nearly all residues of 3CL pro involved in 


Vol. 143, No. 4, 2008 


525 


© 2008 The Japanese Biochemical Society. 



526 


S. Chen et al. 


Domain I 


Glyl 1(B) 

“ !' =fTyr 126(B) 


Domain II 


B 



Lysl37(B) 


:?Cysl28(B).AUl/ 

#Ph< 


Phc140(B) 


Domain III 


Phe3(B) 


Lcul41(B) 


f ) 07 Gly2(B) 



Phc3(A) ' 


Phe 140(A) 


. -_ jfpro' 

<fe. V^v..Phe8(A)PSerI39(B)% m 

'""J Q ^ tJi 


|Mcl6(B) 


Pro9(B)' 


Tyrl26(A)| Glyl24(A)| * J, 
Mct6(Ay| 

^ 3,0 



Pro 122(B) 
Vail 25(B) 


Alai 16(A) 




Ala7(A) J 


Glyl 1(A) 


Key 

Residues of surface 
Residues of second surface 
• Hydogen bond and its length 


• Asn277(A)t ^ 
1(A)^ ^ 




Tyr237(B) 

>n 

Lcu272(B) 



Hta JJ 


Glul4(B) 


Residues involved in hydrophobic 
contact(s) 

Corresponding atoms involved in 
hydrophobic contact(s) 


Fig. 1. The dimeric structure of SARS-CoV 3CL pro and 
extensive residue-residue interactions on the dimer 
interface. (A) A ribbon diagram for the crystal structure of 
SARS-CoV 3CL pro (PDB: 1UK2). Monomer A and B are 
represented as black and grey, respectively and the three 
domains are also labelled. The residues involved in monomer- 
monomer interactions, which were selected for subsequent single 


point mutation analyses, are shown in the bond model. The 
binding peptide substrate (MP) is also shown as the stick model. 
(B) A surface model of the protease. The two monomers are in the 
same orientation as shown in panel A. (C) The dimer interface 
between monomer A and B. The bonds and residues belonging to 
monomer A or B are labelled respectively. 


dimerization are conserved, it has been indicated that 
only the dimer is the biological functional form of SARS- 
CoV 3CL pro (12, 13). Tan et al.(14) revealed that the low 
enzymatic activity of the dissociated monomer is mainly 
due to the collapse of the oxyanion hole in the SI 
substrate-binding subsite. Since the dissociated monomer 
might be inactive, the dimer interface has been sug¬ 
gested as a potential target for rational inhibitors design 
against SARS-CoV 3CL pro (7, 15). An octapeptide inter¬ 
face inhibitor, designed according to the sequence of the 


N-terminal finger, was found to bind to SARS-CoV 
3CL pro specifically and competitively (16). 

The crystal structure (6) and molecular dynamics 
calculations (14) revealed that the dimer interface of 
SARS-CoV 3CL pro mainly consists of the interactions 
between two helical domains III of each monomer, and 
the hydrogen bonding and electrostatic interactions 
between the N-terminal finger of one monomer and 
the residues near SI substrate-binding subsite of 
the other monomer, in particular an oxyanion loop 
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Fig. 2. Residue-residue interactions. (A) Residue-residue 
interactions between the N-terminal finger and the SI 
subsite in the substrate-binding pocket of SARS-CoV 3CL pro . 
(B) Residue-residue interactions between two a-helix A' (resi¬ 
dues 10-15) of domain I in SARS-CoV 3CL pro dimer. The 
residues belonging to monomer A or B (PDB: 1UK2) are 


marked respectively. The labelled residues are shown as sticks, 
and the rest of the proteins as cartoon. Dashes represent the 
hydrogen bonds formed on the dimer interface. The hydrophobic 
interactions between the side-chain phenyl of Phe3 or 
Phel40 and the neighbouring residues are also labelled as the 
surface model. 


(residues 138-145) (Fig. 1A and C). According to the 
reported studies mentioned above, the structural integ¬ 
rity of the active site appears to be intrinsically 
connected with the presence of an intact dimer interface 
for SARS-CoV 3CL pro . To address this hypothesis, we 
performed the structure-guided mutagenesis analyses of 
the protease in this study. Totally seven residues on the 
dimer interface were selected, including three residues in 
the N-terminus (Seri, Phe3, Arg4) (Fig. 2A), two residues 


in the a-helix A' of domain I (SerlO, Glul4) (Fig. 2B), 
and two residues of the oxyanion loop near the SI 
subsite in domain II (Serl39, Phel40) (Fig. 2A). These 
residues are mainly involved in dimerization of SARS- 
CoV 3CL pro through hydrogen bonding and highly 
conserved among different CoV 3CL proteases. In the 
following, we evaluated the effects of these residues 
on dimer conformational stability and catalytic activity 
of SARS-CoV 3CL pro using various biochemical and 
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Table 1. Nucleotide sequences of the primers used for site-directed mutagenesis of SARS-CoV 3CL pro . a 


Oligonucleotide sequence (5'—»- 3') 

CATCACGGATCCACCATGGCTGGTTTTAGGAAAATGGC 

GCCATTTTCCTAAAACCAGCCATGGTGGATCCGTGATG 

CCACCATGAGTGGTGCTAGGAAAATGGCATTCCCG 

CGGGAATGCCATTTTCCTAGCACCACTCATGGTGG 

CCACCATGAGTGGTTTTGCGAAAATGGCATTCCCGTC 

GACGGGAATGCCATTTTCGCAAAACCACTCATGGTGG 

GGAAAATGGCATTCCCGGCAGGCAAAGTTGAAGG 

CCTTCAACTTTGCCTGCCGGGAATGCCATTTTCC 

CGTCAGGCAAAGTTGCAGGGTGCATGGTAC 

GTACCATGCACCCTGCAACTTTGCCTGACG 

CATACCATTAAAGGTGCTTTCCTTAATGGATCATGTGG 

CCACATGATCCATTAAGGAAAGCACCTTTAATGGTATG 

CATACCATTAAAGGTTCTGCCCTTAATGGATCATGTGG 

CCACATGATCCATTAAGGGCAGAACCTTTAATGGTATG 


Polarity Mutation introduced 


Forward 

SARS-CoV 3CL pro 

Ser 1 Ala 

Reverse 

SARS-CoV 3CL pro 

Ser 4 Ala 

Forward 

SARS-CoV 3CL pro 

Phe 3 Ala 

Reverse 

SARS-CoV 3CL pro 

Phe 3 Ala 

Forward 

SARS-CoV 3CL pro 

Arg 4 Ala 

Reverse 

SARS-CoV 3CL pro 

Arg 4 Ala 

Forward 

SARS-CoV 3CL pro Ser 10 Ala 

Reverse 

SARS-CoV 3CL pro Ser 10 Ala 

Forward 

SARS-CoV 3CL pro 

Glu 14 Ala 

Reverse 

SARS-CoV 3CL pro 

Glu 14 Ala 

Forward 

SARS-CoV 3CL pro 

Ser 139 Ala 

Reverse 

SARS-CoV 3CL pro 

Ser 139 Ala 

Forward 

SARS-CoV 3CL pro 

Phe 140 Ala 

Reverse 

SARS-CoV 3CL pro 

Phe 140 Ala 


“The mutant codons in the oligonucleotide sequences are highlighted in boldface. SARS-CoV 3CL pro amino acids are numbered continuously 
from the N-terminal residue, Ser 1 , to the C-terminal residue. Gin 303 . 


biophysical techniques. It was demonstrated that all seven 
single point mutated proteases can still form the dimer at 
varying concentrations, while the monomer-dimer equili¬ 
bria of these mutants in solution are different from that of 
the wild type protease. Furthermore the proteolytic 
activities of these mutants decreased to varying extents 
compared with the wild type protease. Although the dimer 
formation of SARS-CoV 3CL pro could not be disrupted 
completely by single point mutation, individual replace¬ 
ment of these residues by alanine might partly disrupt the 
integrality of the hydrogen bonding networks on the dimer 
interface, which perhaps induces an altered conformation 
of the substrate-binding pocket, therefore, results in the 
decrease or loss of the enzymatic activity. 

MATERIALS AND METHODS 

Simulation System —Initial coordinates for SARS-CoV 
3CL pro dimer was taken from the crystal structure (6) 
(PDB code: 1UK2). The missing residues were repaired 
using the loop search method in the Homology module of 
Insight II. For the simulation of SARS-CoV 3CL pro dimer 
in aqueous solution, the protein was first put into a 
suitably sized box, of which the minimal distance from 
the protein to the box wall was 1.5 nm. Then the box was 
solvated with the SPC water model (17). The protein/ 
water system was submitted to energy minimization. 
Later, counterions were added to the system to provide a 
neutral simulation system. The whole system was 
subsequently minimized again. 

Molecular Dynamics Simulations —Conventional molec¬ 
ular dynamics (CMD) simulations were carried out using 
the AMBER 7.0 package with NPT and periodic 
boundary conditions. The Amber Parm99 force field (18) 
was applied for the proteins. The Particle Mesh Ewald 
(PME) method (19) was employed to calculate the long- 
range electrostatics interactions. The non-bonded cutoff 
was set to 12.0 A, and the non-bonded pairs were 
updated every 25 steps. The SHAKE method (20) was 
applied to constrain all covalent bonds involving hydro¬ 
gen atoms. Each simulation was coupled to a 300 K 
thermal bath at 1.0 atm pressure by applying the 


algorithm of Berendsen (21). The temperature and 
pressure coupling parameters were set as 0.2 ps and 
0.05 ps, respectively. An integration step of 2fs was set 
up for the MD simulations. 

Materials —The restriction and modifying enzymes in 
this work were purchased from NEB. The vector pQE30 
and the bacterial strain M15 were from Qiagen. 
Isopropyl (3-D-thiogalactoside (IPTG) was purchased 
from Promega. The Ni-chelating column and low molec¬ 
ular weight marker for SDS-PAGE were purchased from 
Amersham Pharmacia Biotech. All other chemicals were 
of reagent grade or ultra-pure quality, and purchased 
from Sigma. 

Cloning, Expression and Purification of the Wild Type 
SARS-CoV 3CL pro — The wild type SARS-CoV 3CL pro was 
prepared according to our published method (22). The 
protease was highly pure according to SDS-PAGE and 
dialyzed to 20 mM Tris-HCl pH 7.5 containing 100 mM 
NaCl, 5 mM dithiothreitol (DTT) and 1 mM ethylene 
diaminetetraacetic acid (EDTA). The purified protein was 
further confirmed by N-terminal sequencing and mass 
spectrometry, and concentrated by Centriprep (Milipore). 
The protein concentration used in all experiments was 
determined from the absorbance at 280 nm (A 280 ) using a 
molar extinction coefficient (s 2 so) for the monomer of 
34,390/M cm (22, 23). 

Site-directed Mutagenesis of the Residues on the Dimer 
Interface of SARS-CoV 3CL pro —Site-directed mutagen¬ 
esis of the residues on the dimer interface of SARS-CoV 
3CL pro was processed by a modified recombinant PCR 
method. Totally seven mutated SARS-CoV 3CL pro s 
(Se^Ala, Phe 3 Ala, Arg 4 Ala, Ser 10 Ala. Glu 14 Ala, 
Ser 139 Ala and Phe 140 Ala) were prepared with the 
QuikChange site-directed mutagenesis kit (Stratagene) 
using pQE30-SARS-CoV 3CL pro as a template. The 
nucleotide sequences of the primers used for mutation 
were given in Table 1. The pQE30-SARS-CoV 3CL pro 
plasmids encoding mutated forms of SARS-CoV 3CL pro 
were verified by sequencing, and then Escherichia coli 
M15 cells were transformed by the resulting plasmids. 
The mutated proteins were expressed and purified in 
a similar procedure to that for the wild type protease. 
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The purity and structural integrity of the mutated 
proteases were analysed by SDS-PAGE, N-terminal 
sequencing and mass spectrometry. The concentrated 
proteins were stored in 20mM Tris-HCl pH 7.5, 100 mM 
NaCl, 5mM DTT, ImM EDTA, at -20°C. 

Circular Dichroism (CD) Spectroscopy—Circular 
Dichroism (CD) spectra were recorded on a JASCO-810 
spectropolarimeter. The protein sample was prepared in 
20 mM sodium phosphate pH 7.5, 100 mM NaCl at 25°C 
with concentration of 10 pM. Far-UV CD spectra from 
190 to 250 nm were collected with 1 nm band width using 
0.1cm path length cuvette, and normalized by subtract¬ 
ing the baseline recorded for the buffer. Each measure¬ 
ment was repeated thrice and the final result was the 
average of three independent scans. The Far-UV CD 
spectra of the mutated proteases were compared with 
that of the wild type SARS-CoV 3CL pro to exclude the 
possibility of structural misfolding caused by single point 
mutation. 

Fluorescence Spectroscopy —The fluorescence experi¬ 
ments were performed on a HITACHI F-2500 fluores¬ 
cence spectrophotometer. The protease sample was 
prepared in 20 mM Tris-HCl pH 7.5, 100 mM NaCl with 
concentration of 5 pM. The fluorescence emission spectra 
from 300 to 380 nm were collected after excitation at 
280 nm, and the spectral slit width was 5nm for 
excitation and emission. Fluorescence spectra of the 
wild type and mutated SARS-CoV 3CL pro s were mea¬ 
sured in a 1 ml quartz cuvette with 1 cm path length at 
25°C. All final spectra were corrected for the buffer 
contribution, and were the average of three parallel 
measurements. 

Glutaraldehyde Cross-linking SDS-PAGE —For the 
wild type and mutated SARS-CoV 3CL pro s (final concen¬ 
tration from 0.2 to 5 mg/ml in 20 mM Tris-HCl pH 7.5, 
100mM NaCl, 5mM DTT, ImM EDTA) an aliquot of 
25% (v/v) glutaraldehyde was added to make a final 
concentration of 0.05 or 0.1% glutaraldehyde. The 
samples were incubated at 25°C for 15 min followed by 
quenching the reaction with the addition of 1.0 M Tris- 
HCl pH 8.0 (0.5% v/v). Orthophosphoric acid was there¬ 
after added into the reaction mixture to result 
in precipitation of the cross-linked proteins. After 
centrifugation (12,000 r.p.m., 4°C), the precipitate was 
re-dissolved in loading buffer and heated at 100°C for 
5 min. SDS-PAGE was run with 10% gels. 

Size-exclusion Chromatography (SEC) Analysis —The 
dimer-monomer equilibria of the wild type and mutated 
SARS-CoV 3CL pro s were analysed by size-exclusion 
chromatography (SEC) on a HiLoad 16/60 Superdex 75 
prep grade column through an AKTA FPLC system 
(Amersham Biosciences). Buffer used was 20 mM Tris- 
HCl pH7.5, 100mM NaCl, 5mM DTT and ImM EDTA. 
The buffer was degassed and the column was equili¬ 
brated with the buffer before injecting protein samples. 
Protein samples with a concentration of 5 mg/ml were 
loaded on the column and then eluted with the buffer at 
a flow rate of lml/min by detection of absorbance at 
280 nm. The integrated area values of absorbance peaks 
were calibrated by AKTA FPLC evaluation software. The 
column was calibrated using a low molecular mass gel 
filtration kit (Amersham Biosciences) with four marker 
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Table 2. Potential residue-residue interactions on the 
dimer interface of SARS-CoV 3CL pro predicted by 5-ns 
molecular dynamics simulations. 3 



Hydrogen Bond 

Time occupancy (%) 

A 

B 

1 

SerlO (OG) 

SerlO (OG) 

99.1 

2 

Glyll(N) 

Glul4 (OE1/OE2) 

100 

3 

Glul4 (OE1/OE2) 

Glyll(N) 

100 

4 

Arg4(NHl/NH2) 

Glu290(OEl/OE2) 

95.7 

5 

Glu290(OEl/OE2) 

Arg4(NHl/NH2) 

96.9 

6 

Serl39(0/0G) 

Gly2(N) 

99.2 

7 

Gly2(N) 

Serl39(0/0G) 

30.3 

8 

Phe3(N) 

Serl39(0/0G) 

88.2 

9 

Serl39(0/0G) 

Phe3(N) 

43.5 

10 

Serl(OG) 

Glul66(OEl/OE2) 

98.3 

11 

Glul66(OEl/OE2) 

Serl(OG) 

55.4 

12 

Phel40(N) 

SerKO/OG) 

93.0 

13 

Serl(0/OG) 

Phel40(N) 

5.1 


a A, monomer A; B, monomer B. 

proteins: Ribonuclease A (13.7 kDa), Chymotrypsinogen 
A (25.0 kDa), Ovalbumin (43.0 kDa) and Albumin 
(67.0 kDa). 

Enzymatic Activity Assay —The catalytic activities of 
the wild type and mutated SARS-CoV 3CL pro s were 
measured by FRET-based assays using a 12-amino 
acid fluorogenic substrate, EDANS-VNSTLQSGLRK 
(Dabcyl)-M, according to our published studies (23, 24). 
During the continuously kinetic assay, the protease (final 
concentration 1 pM) was pre-incubated for 30 min at 25°C 
with the assay buffer (20 mM Tris-HCl pH 7.5, 100 mM 
NaCl, 5mM DTT and ImM EDTA), followed by the 
addition of the fluorogenic substrate (final concentration 
10 pM). The fluorescence intensity was monitored on a 
GENios microplate reader (TECAN, Mannedorf, 
Switzerland) and the instrument was first set to zero 
with the fluorogenic substrate. Cleavage of the substrate 
as a function of time was measured by the increase in 
emission fluorescence intensity upon continuous monitor¬ 
ing of reactions in a 96-well black microplate (BMG 
LABTECH, Offenburg, Germany) using wavelengths of 
340 nm and 488 nm for excitation and emission, respec¬ 
tively. The incubation of the substrate in the assay buffer 
without the protease was also performed as a control. 
Enzymatic activity was the average of three parallel 
assays and the activity of the wild type SARS-CoV 
3CL pro was taken as 100%. 

RESULTS 

Preparation of the Seven Mutated Proteases Involved in 
the Dimer Interface —To predict the key factors that 
maintain the stability of the dimer interface, 5-ns CMD 
simulations were firstly conducted on the dimer of SARS- 
CoV 3CL pro . All interactive residues between monomer A 
and B, as shown in Fig. 1C, were monitored for the time 
occupancy during the whole simulation process. The 
hydrogen bonds formed on the dimer interface were 
calculated by using HPLUS (25). Interestingly, more 
than 10 hydrogen bonding interactions occupy most time 
of simulation (Table 2), suggesting that the residues 
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involved in these interactions might possibly make well 
contributions to keep the dimer conformational stability. 
Guided by this potential information, we selected seven 
residues on the dimer interface for site-directed muta¬ 
genesis (Fig. 1A). These residues are mainly involved in 
the dimerization of SARS-CoV 3CL pro through hydrogen 
bonding and hydrophobic interactions with their side- 
chain or main-chain groups (Fig. 2), and single Ala 
substitution might perturb the entirety of the hydrogen 
bonding networks on the dimer interface. 

According to the preparation strategy previously 
reported in our lab (22), we expressed SARS-CoV 
3CL pro as an N-terminal His-tagged protein for purifica¬ 
tion convenience. Considering the results shown in 
several other publications (10, 15, 26, 27) that 

N-terminal extra amino acids, e.g. the purification 
affinity tag, might interfere with dimerization of SARS- 
CoV 3CL pro , we also constructed the protease into a 
vector without affinity tag and evaluated the dimeriza¬ 
tion feature of the un-tagged protein. While there are no 
obvious differences observed for the dimer-monomer 
equilibrium in solution between the two purified pro¬ 
teases (data not shown), thus we performed all subse¬ 
quent assays with the N-terminal His-tagged 3CL pro . 

Similar with the wild type protease, all the seven 
single point mutants were also successfully cloned and 
expressed in E. coli M15 cells. The majority of the 
proteins could be obtained in the soluble fraction of 
the cell lysate. SDS-PAGE analyses indicated that all 
mutated proteases are highly homogeneous in solution. 
Although the corresponding protein bands in SDS-PAGE 
would shift little faster than the molar marker of 
35.0 kDa, the recombinant proteins have been clearly 
identified as SARS-CoV 3CL pro with a molecular mass of 
35.8 kDa by mass spectrometric characterization (data 
not shown), in agreement with the values calculated from 
the protein sequences and the published data from our 
laboratory (22). 

Figure 3 shows the Far-UV CD spectra of the wild type 
and seven mutants of SARS-CoV 3CL pro . The spectra of 
the seven mutated proteases seem to be similar to that of 
the wild type SARS-CoV 3CL pro . All spectra give a 
positive peak at 196 nm and dual negative peaks at 209 
and 222 nm, typical of a mixture of a-helical and (3-sheet 
structures. These results indicated that all seven 
mutated proteases have well-defined secondary struc¬ 
tures and excluded the possibility of structural misfold- 
ing caused by single residue mutation. However small 
changes of the CD spectra do exist as shown in Fig. 3, 
which might be due to minor structural changes induced 
by Ala mutations. 

The fluorescence emission spectra of the wild type and 
seven mutants of SARS-CoV 3CL pro are also shown in 
Fig. 4. The emission /, max of the wild type SARS-CoV 
3CL pro is 325 nm. Similar to the wild type protease, all 
seven mutated proteins show only minor difference on 
the emission >„ max (varying from 324 nm to 327 nm), 
further demonstrating that replacement of single residue 
on the dimer interface by Ala has not changed the folding 
manner of the protease. 

Chemical Cross-linking Analyses of the Seven Mutated 
Proteases —Similar to 3CL proteases of human 



Fig. 3. CD spectra of the wild type and site-directed 
mutants for SARS-CoV 3CL pro . Far-UV CD spectra of the 
wild type and seven mutated SARS-CoV 3CL pro s at 25°C. 
Protein concentrations used in CD experiments were 10 pM 
and all protein samples were prepared in 20 mM sodium 
phosphate pH 7.5, 100 mM NaCl. The CD spectrum of the wild 
type protease is shown in black and the spectra of the mutated 
proteases are shown in light gray. 



Fig. 4. Fluorescence emission spectra of the wild type and 
site-directed mutants for SARS-CoV 3CL pro . Fluorescence 
emission spectra of the wild type and seven mutated proteases 
were recorded at 25°C after excitation at 280 nm. The protease 
samples (5pM) were prepared in 20 mM Tris-HCl pH 7.5, 
100 mM NaCl. The spectrum of the wild type protease is 
shown in black and those of the mutants are shown in light 
gray. 


coronavirus (HCoV) 229E and transmissible gastroenter¬ 
itis coronavirus (TGEV) (28), SARS-CoV 3CL pro can form 
a dimer in the crystal structure and solution (6, 13). The 
dimerization features of SARS-CoV 3CL pro have been 
successfully characterized by various biochemical and 
biophysical methods (7-9, 12). According to the published 
method (29), we first performed the chemical cross- 
linking analysis of the wild type SARS-CoV 3CL pro . 
When incubated with 0.05% glutaraldehyde, the protease 
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Fig. 5. SDS-PAGE profiles of glutaraldehyde cross-linked 3CL pro (5 mg/ml) cross-linked by 0.1% glutaraldehyde; lane 3b, 
SARS-CoV 3CL pro s. (A) Cross-linking analysis of the wild type 3CL pro (5 mg/ml) cross-linked by 0.05% glutaraldehyde; lanes 4a 
SARS-CoV 3CL pro . (B-H) Cross-linking analyses of Serl_Ala, and 4b, 3CL pro (1 mg/ml), 0.1 and 0.05% glutaraldehyde; lanes 5a 
Phe3_Ala, Arg4_Ala, Serl0_Ala, Glul4_Ala, Serl39_Ala and and 5b, 3CL pro (0.5 mg/ml), 0.1 and 0.05% glutaraldehyde; lanes 
Phel40_Ala mutants, respectively. Lane 1, untreated 3CL pro 6a and 6b, 3CL pro (0.2 mg/ml), 0.1 and 0.05% glutaraldehyde. 

(5 mg/ml); lane 2, molecular weight protein standards; lane 3a, 


at a concentration of 0.2 mg/ml displayed a form of 
monomer near 35.0 kDa with the other band correspond¬ 
ing to the dimer (Fig. 5A, lane 6b). With protein 
concentration increasing, both of the dimeric and mono¬ 
meric forms increased (Fig. 5A, lanes 5b-3b). A similar 
cross-linking pattern of the protease was observed when 
using a higher concentration of glutaraldehyde (0.1%), 
excluding the possibility of obvious artificial cross-linking 
effects (Fig. 5A, lanes 6a-3a). These results indicate that 
the wild type protease exists as a mixture of monomer 
and dimer at varying concentrations, which is consistent 
with the reported studies (7, 8). 

To preliminarily examine the effects of Ala mutations 
of the selected seven residues on dimerization of SARS- 
CoV 3CL pro , the chemical cross-linking analyses of the 
mutated proteases were also performed, respectively. 
Conclusively all seven mutated proteases displayed 
similar cross-linking patterns with the wild type protease 
(Fig. 5B-H). The dimeric form of each mutated protease 
also existed within a wide range of protein concentra¬ 
tions, suggesting that mutation of single residue on the 
dimer interface could not completely abolish the dimeric 
structure of SARS-CoV 3CL pro in solution. However 
moderate differences of dimer-monomer equilibria do 
exist among these mutants. For the Serl0_Ala and 
Glul4_Ala mutants (Fig. 5E and F), the amount of the 
dimer was relatively low compared with the wild type 
and other mutated proteases, indicating that the a-helix 
A' of domain I might be an important part of the dimer 
interface and relatively contribute more to maintain the 
dimer stability of SARS-CoV 3CL pro . In addition, we 
should note that the possibility of minor artificial cross- 
linking effects might still exist due to the appearance of 


the high-order multimers in SDS-PAGE (Fig. 5). While, 
considering the chemical cross-linking analyses of the 
wild type and mutant proteases were performed under 
exactly same experimental procedures, these analyses 
might still be convincing to preliminarily examine the 
effects of these mutations on dimerization of SARS-CoV 
3CL pro . 

SEC Analyses of the Seven Mutated Proteases —In 
order to more exactly evaluate the perturbation of 
dimer-monomer equilibrium caused by these mutations, 
we performed SEC analyses to further characterize the 
wild type and mutated SARS-CoV 3CL pro s. We used a 
protein concentration at 5 mg/ml for each run, which 
represents the highest concentration used in the cross- 
linking experiments, and the physical states correspond¬ 
ing to native monomeric and dimeric protease were 
observed. As shown in Fig. 6A, the wild type SARS-CoV 
3CL pro elutes in two peaks with the retention volumes at 
44.6 and 62.1ml. The elution profiles of four molecular 
mass marker proteins confirmed that the first peak 
might correspond to the dimer state (71.6 kDa) and the 
second peak would represent the monomeric species of 
SARS-CoV 3CL pro (35.8 kDa), in well agreement with 
the reported result (8). We also collected the fractions 
representing these two elution peaks and analysed them 
by SDS-PAGE, and the corresponding protein bands 
further indicated that both of these two peaks are SARS- 
CoV 3CL pro (data not shown). The amount of the dimer 
and monomer could be further quantified by the 
integrated area values of these two peaks and the 
dimer/monomer ratio of the wild type protease was 
estimated as 1.02 (Table 3). This observation thus 
indicates that in solution the wild type protease exhibits 
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Retention volume [ml] 


Fig. 6. Dimer-monomer equilibria of the wild type and 
site-directed mutants for SARS-CoV 3CL pro analysed by 
SEC. (A) Elution profile of the wild type SARS-CoV 3CL pro at 
neutral pH (7.5) and a concentration of 5 mg/ml; (B-(H) Elution 
profiles of Serl_Ala, Phe3_Ala, Arg4_Ala, SerlO_Ala, Glul4_Ala, 






Serl39_Ala and Phel40_Ala mutants at concentrations of 5 mg/ 
ml, respectively. Elution profiles of four marker proteins are also 
shown in arrow labels. Each protein sample was loaded to a 
HiLoad 16/60 Superdex 75 prep grade column and then eluted at 
a flow rate of lml/min with detection of absorbance at 280 nm. 
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Table 3. Elution profiles of the wild type and site-directed mutants for SARS-CoV 3CL pro in solution by SEC analyses. 


Protein 

Elution peak Dimer (ml) 

Elution peak mcmomer (ml) 

Dimer (%) a 

Monomer (%) a 

D (%)/M (%) 

WT SARS-CoV 3CL pro 

44.6 

62.1 

50.5 

49.5 

1.02 

SARS-CoV 3CL pro Ser'Ala 

43.4 

62.8 

51.9 

48.1 

1.08 

SARS-CoV 3CL pro Phe 3 Ala 

44.8 

63.4 

48.2 

51.8 

0.93 

SARS-CoV 3CL pro Arg 4 Ala 

44.8 

60.4 

31.1 

68.9 

0.45 

SARS-CoV 3CL pro Ser 10 Ala 

45.8 

60.2 

39.8 

60.2 

0.66 

SARS-CoV 3CL pro Glu 14 Ala 

44.4 

62.4 

26.5 

73.5 

0.36 

SARS-CoV 3CL pro Ser 139 Ala 

43.8 

59.2 

44.8 

55.2 

0.81 

SARS-CoV 3CL pro Phe 140 Ala 

44.2 

60.6 

38.7 

61.3 

0.63 


“The percentage of dimers (D) and monomers (M) was estimated by deconvolution of the corresponding SEC elution profiles. 


both forms of monomer and dimer and the amount of the 
monomer is almost equal to that of the dimeric form, in 
well agreement with the chemical cross-linking analysis 
and literature report (13). 

For the seven mutated proteases under identical 
conditions, the two elution-peaks representing the 
dimer and monomer states were also monitored, respec¬ 
tively (Fig. 6B-H). The results demonstrate that dimer¬ 
ization of SARS-CoV 3CL pro could not be disrupted 
entirely by mutation of single residue on the dimer 
interface, further supporting the chemical cross-linking 
results. Compared with the wild type protease, these 
mutants showed minor drifts on the retention volumes of 
the two elution peaks (Table 3, varying from 43.4 ml to 
45.8 ml and from 59.2 ml to 63.4 ml), indicative of possible 
subtle conformational changes of the dimer and monomer 
structures. Furthermore, the dimer/monomer ratios of 
these mutants differentiated significantly from each 
other (Table 3), implying that the contributions of these 
residues to the monomer-dimer equilibrium of SARS- 
CoV 3CL pro are quite different. For the Serl_Ala, 
Phe3_Ala and Serl39_Ala mutants, the ratios between 
the dimers and monomers were 1.08, 0.93 and 0.81, 
respectively, which indicates that these three residues 
could only affect the dimer interface stability to a lesser 
extent. For the other mutants, especially the Arg4_Ala 
and Glul4_Ala mutants, the dimer/monomer ratios 
decreased obviously and were nearly 2 to 3-fold lower 
than that of the wild type protease, suggesting that the 
amount of the dimer has decreased and the monomer is 
the predominant form. Overall, Glul4, Arg4, Phel40 and 
SerlO (in decreasing order) on the dimer interface are 
the relatively more critical residues for stabilizing the 
dimeric structure of SARS-CoV 3CL pro . 

Enzymatic Activity Assays of the Seven Mutated 
Proteases —Several published results have proposed that 
only the dimer should be the biological functional form of 
SARS-CoV 3CL pro and the dissociated monomer might be 
enzymatic inactive (13, 30). Meanwhile, alteration of the 
correct conformation of the dimeric structure could also 
lead to a complete loss of the catalytic activity (8, 14). 
Although Ala replacement of single residue on the dimer 
interface could not completely result in the dimer 
dissociation in solution, the seven residues we selected 
still might affect the catalytic activity of SARS-CoV 
3CL pro considering their contributions to stabilize the 
monomer-monomer interface. To verify this prediction, 
we determined the enzymatic activities of the wild type 
and seven mutated proteases by a fluorogenic substrate 



Fig. 7. Fluorescence profiles of hydrolysis of the fluoro¬ 
genic substrate by the wild type and site-directed 
mutants for SARS-CoV 3CL pro . The fluorogenic substrate at 
a concentration of 10 pM was incubated with 1 pM wild type or 
mutated SARS-CoV 3CL pro in 20 mM Tris-HCl pH7.5, 100 mM 
NaCl, 5mM DTT, ImM EDTA, at 25"C. Increase of emission 
fluorescence intensity at 488 nm wavelength was recorded at 
10 min intervals, X EX = 340nm. The emission spectrum was 
recorded for 90 min and the activity of the wild type protease 
was taken as 100%. 

reported previously in our lab (23, 24). The catalytic 
activity of SARS-CoV 3CL pro and relevant inhibitors 
screening have been characterized extensively by the 
FRET-based assay (26, 31-33). As shown in Fig. 7, the 
fluorescence increase following hydrolysis of the sub¬ 
strate by the wild type SARS-CoV 3CL pro is significant 
and time-dependent, implying that the protease could 
hydrolyze the substrate efficiently. As expected, the 
fluorescence profiles of the seven mutants were obviously 
different from that of the wild type protease (Fig. 7), 
which indicates that mutation of these residues could 
inactivate the catalytic activity of SARS-CoV 3CL pro to 
varying extents. In detail, mutation of residues SerlO 
and Phel40 almost produced the complete loss of the 
enzymatic activity and the catalytic activities of the 
Phe3_Ala, Glul4_Ala and Arg4_Ala mutants were also 
decreased to only 4-10% of that of the wild type protease 
(Table 4). While the mutants of Serl_Ala and Serl39_Ala 
still possessed 46 and 58% of enzymatic activity, 
respectively. (Table 4). These results further support 
the conclusions derived from the SEC analyses that the 
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Table 4. Enzymatic activities of the wild type and site- 
directed mutants for SARS-CoV 3CL pro . a 


Protein 

Proteolytic activity (%) 

WT SARS-CoV 3CL pro 

100 

SARS-CoV 3CL pro Ser'Ala 

46 

SARS-CoV 3CL pro Phe 3 Ala 

5 

SARS-CoV 3CL pro Arg 4 Ala 

10 

SARS-CoV 3CL pro Ser 10 Ala 

<1 

SARS-CoV 3CL pro Glu 14 Ala 

4 

SARS-CoV 3CL pro Ser 139 Ala 

58 

SARS-CoV 3CL pro Phe 140 Ala 

<1 


“Enzymatic activities were averages determined in three parallel 
experiments. The activity of wild type SARS-CoV 3CL pro was taken 
as 100%. 


extensive monomer-monomer interactions regulated by 
these residues could stabilize the dimeric structure at 
different degrees. However, it is noticeable that the 
influence of these mutations on the catalytic activity is 
more obvious than that on dimerization of SARS-CoV 
3CL pro , which will be discussed below. 

DISCUSSION 

Proteolytic processing of the non-structural polyproteins 
is a vital step in the replication cycle of coronavirus, and 
such processing is commonly performed by virus-genome 
encoded proteases including 3CL pro ( 4, 34-36). Therefore 
3CL pro has been appreciated as an attractive target in 
discovering anti-coronavirus agents (5). SARS-CoV 
3CL pro shares high homology with the 3CL pro s of other 
coronaviruses, and the 3D structures of different coro¬ 
navirus 3CL pro s are more conserved than their sequences 
(6, 37). SARS-CoV 3CL pro has been extensively charac¬ 
terized for its structural property and enzymatic activity 
(8, 9, 12-15, 27, 38, 39). The protease can form a 
homodimer in crystal and solution (Fig. 1A and B), and 
the dimeric structure is proposed to be indispensable for 
enzymatic activity. Much progress has been made for 
understanding the correlation between dimerization and 
catalytic activity of SARS-CoV 3CL pro {7-10, 12, 27, 30). 
Recently a systematic mutagenesis study reported an 
initial attempt to map the dimerization interface on the 
helical domain III of the protease {11). 

In the present study, we focused another seven 
residues on the dimer interface of SARS-CoV 3CL pro for 
single point mutagenesis (Fig. 1A). These selected 
residues are predicted to involve in dimerization mainly 
through hydrogen bonding (Table 2 and Fig. 2). 
Structurally, the seven mutated proteases could be 
divided into three groups. The first group includes 
Serl_Ala, Phe3_Ala and Arg4_Ala mutants regarding 
three residues on the N-terminal finger of SARS-CoV 
3CL pro . The N-terminal finger of one monomer can form 
intensive interactions with domain II of the other 
monomer (6) (Fig. 2A), e.g. the NH group of Seri in 
monomer B donates hydrogen bonds to the main-chain 
carbonyl of Phel40 in monomer A, as well as the side- 
chain carboxylate of Glul66A; and the side-chain OH 
group of SerlB forms a hydrogen bond with the main- 
chain NH group of Phel40A. In addition, the NH group 
of Phe3B donates a hydrogen bond to the side-chain OH 


group of Serl39A. This pair of hydrogen bond might be 
stabilized by hydrophobic interactions between the side- 
chain phenyl of Phe3B and the neighbouring residues, 
e.g. Leu282B and Phe291B. Whereas replacement of 
residue Seri or Phe3 by Ala rendered little influence on 
the dimer-monomer equilibrium of SARS-CoV 3CL pro 
(Fig. 5B, C and Table 3), indicating that these two 
residues might not play a vital role in dimerization. The 
results are in agreement with a reported study that the 
N-terminal residuesl-3 truncated protease still exhibits 
a tendency to form dimer (9). The Serl_Ala mutant 
maintained 46% of enzymatic activity and the activity of 
the Phe3_Ala mutant was nearly 10-fold lower than that 
of the wild type protease (Table 4), implying that in 
addition to dimerization, residues Seri and Phe3 could 
regulate the catalytic activity of the protease by other 
mechanisms. According to the crystal structure (Fig. 2A), 
the monomer-monomer interactions mediated by Seri 
and Phe3 might be helpful to maintain the correct 
catalytic conformation of the SI subsite, and mutation of 
Seri or Phe3 possibly induces an altered uncompetitive 
conformation of the SI subsite. Nevertheless, this 
hypothesis should be verified by the crystal structures 
determination of these two mutated proteases. Besides 
Seri and Phe3, another residue Arg4 was also selected 
for mutagenesis study. The side-chain guanidyl of Arg4 
in one monomer forms a salt bridge with the side-chain 
of Glu290 in the other monomer (Fig. 2A), which has 
been reported as one of the major interactions between 
the two monomers {12). Here, Arg4_Ala mutant was 
shown to a tendency to monomer state (Table 3) and a 
very weak enzymatic activity (Table 4), further demon¬ 
strating the importance of the Arg4-mediated interac¬ 
tions in the quaternary structure and activity of the 
protease. Although the role of the N-terminal finger has 
been assessed by many investigations (9, 10), our results 
revealed that the residues on the N-terminal finger 
indeed contribute differently to the dimer stability and 
catalytic activity of SARS-CoV 3CL pro . 

The second group of the mutants is about SerlO_Ala 
and Glul4_Ala, which are related to two residues on the 
ot-helix A' of domain I of SARS-CoV 3CL pro . The two 
residues are highly conserved among different corona- 
virus 3CL proteases and also extensively involved in 
monomer-monomer interactions (Fig. 1C). Residue SerlO 
from each monomer can form a pair of hydrogen bond 
between the main-chain NH group and the side-chain 
OH group, and the side-chain carboxylate of Glul4 in one 
monomer donates a hydrogen bond with the main-chain 
NH group of Glyll in the other monomer (Fig. 2B). In 
the present study, both of the two mutants were shown 
to weak dimerization (Fig. 5E-F and Table 3) and have 
no detectable enzyme activity either (Table 4), indicating 
that the a-helix A' of domain I might also be a critical 
region for dimerization. Structurally, the a-helix A' 
(residues Serl0-Glyl5) connects to the N-terminal 
finger of SARS-CoV 3CL pro and might determine the 
correct spatial orientation of the N-terminal finger 
(Fig. 2B). In the dimer structure, the N-terminal finger 
can squeeze into the space between domain III of its 
parent monomer and domain II of the neighbouring 
monomer, which is indispensable for maintaining the 
correct catalytic conformation of the protease {6, 8). 
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Mutation of SerlO or Glul4 to Ala is possible to partly 
disrupt the structure of the a-helix A' and produce a mis- 
oriented N-terminal finger, thus making the protease 
completely inactive. However, this conclusion should be 
further confirmed by the crystal structures of the 
corresponding mutated proteases (unpublished data 
from this laboratory). 

In addition, we also performed another group of 
mutants including Serl39_Ala and Phel40_Ala. These 
two residues are located in the oxyanion loop (residues 
138-145) of domain II and involved in the dimer 
interface by interacting with the N-terminus residues of 
the other monomer (Fig. 2A). The oxyanion loop is 
associated with the formation of the SI subsite in the 
substrate-binding pocket, which determines the absolute 
specificity of SARS-CoV 3CL pro for Glu in the PI position 
of the substrate ( 6 , 14). The oxyanion loop is very flexible 
and a rearrangement of its correct conformation could 
induce the collapse of the oxyanion hole (Glyl43-Serl44- 
Cysl45) in the SI subsite, therefore, inactivate the 
protease completely (14). The Serl39_Ala mutant 
showed only a minor difference in the monomer-dimer 
equilibrium with the wild type protease (Table 3), 
implying that the contribution of residue Serl39 to 
dimerization is not dominant. While the Serl39_Ala 
mutant preserved only 50% of the wild type activity 
(Table 4), which indicates that mutation of Serl39 might 
directly affect the catalysis, most probably by altering the 
conformation of the oxyanion loop. Although Phel40 
donated hydrogen bonds to Seri through its main-chain 
groups, the Phel40_Ala mutant still had an obvious trend 
to the monomer state (Table 3). It is possible that the 
hydrophobic packing between the side-chain phenyl of 
Phel40 and the residues nearby ( e.g. Hisl63, Hisl72) 
would also stabilize the Phel40-Serl interactions 
(Fig. 2A). Meanwhile, mutation of Phel40 completely 
abolished the proteolytic activity of SARS-CoV 3CL pro 
(Table 4), further suggesting that the interactions 
mediated by Phel40 might also contribute well to maintain 
the conformational stability of the SI subsite (14). 

In summary, our study characterized the contributions 
of several previously unidentified residues to the dimer 
stability and catalytic activity of SARS-CoV 3CL pro . Since 
the dimeric structure has been proved to be indispen¬ 
sable for enzymatic activity of SARS-CoV 3CL pro , it is 
easy to understand the conclusion of no dimer, no 
activity. In this study, some residues have been revealed 
to be important for both dimerization and activity. 
Meanwhile, SARS-CoV 3CL pro is a very flexible protein 
and the correct conformation state might also be vital 
for the protease to maintain its full activity. Thus, in 
addition to dimer dissociation, an altered conformation of 
the substrate-binding pocket possibly induced by single 
mutation on the dimer interface could also make the 
protease inactive. Our future study should be focused on 
determining the crystal structures of these mutated 
proteases, which will shed more light on understanding 
the dimerization-activity relationship of SARS-CoV 
3CL pro . 
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